Integrating Triangle and Jaccard similarities for recommendation
نویسندگان
چکیده
This paper proposes a new measure for recommendation through integrating Triangle and Jaccard similarities. The Triangle similarity considers both the length and the angle of rating vectors between them, while the Jaccard similarity considers non co-rating users. We compare the new similarity measure with eight state-of-the-art ones on four popular datasets under the leave-one-out scenario. Results show that the new measure outperforms all the counterparts in terms of the mean absolute error and the root mean square error.
منابع مشابه
A note on the triangle inequality for the Jaccard distance
Two simple proofs of the triangle inequality for the Jaccard distance in terms of nonnegative, monotone, submodular functions are given and discussed. The Jaccard index [8] is a classical similarity measure on sets with a lot of practical applications in information retrieval, data mining, machine learning, and many more (cf., e.g., [7]). Measuring the relative size of the overlap of two finite...
متن کاملImproving Neighborhood-Based Collaborative Filtering by a Heuristic Approach and an Adjusted Similarity Measure
“Collaborative filtering” is the most used approach in recommendation systems since it provides good predictions. However, it still suffers from many drawbacks such as sparsity and scalability problems especially for huge datasets which consist of a large number of users and items. This paper presents a new algorithm for neighborhood selection based on two heuristic approaches. The first of whi...
متن کاملInvestigating the Factors Affecting the Whistle-blowing Intentions Using the Fraud Triangle and the Theory of Planned Behavior: An Accountant's Perspective
Nowadays, whistle-blowing is an important mechanism for detecting and preventing fraud. The purpose of the present research is to investigate factors that influence accountants’ intentions of whistle-blowing by integrating the fraud triangle components, as the external environmental factors that determine whistleblowing intention, and the extended theory of planned behavior, as the individual i...
متن کاملAutomatic Hashtag Recommendation in Social Networking and Microblogging Platforms Using a Knowledge-Intensive Content-based Approach
In social networking/microblogging environments, #tag is often used for categorizing messages and marking their key points. Also, since some social networks such as twitter apply restrictions on the number of characters in messages, #tags can serve as a useful tool for helping users express their messages. In this paper, a new knowledge-intensive content-based #tag recommendation system is intr...
متن کاملارائه یک الگوریتم خوشه بندی برای داده های دسته ای با ترکیب معیارها
Clustering is one of the main techniques in data mining. Clustering is a process that classifies data set into groups. In clustering, the data in a cluster are the closest to each other and the data in two different clusters have the most difference. Clustering algorithms are divided into two categories according to the type of data: Clustering algorithms for numerical data and clustering algor...
متن کامل